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SEQUENCE LISTING 



<110> BOEHRINGER INGELHEIM (CANADA) LTD. 

<12 0> SELF REPLICATING RNA MOLECULE FROM 
HEPATITIS C VIRUS 

<130> 13/083-3-D2 

<140> 10789355 
<141> 2004-02-27 



<150> 60/257, 857 
<151> 2000-12-22 

<150> 10/029, 907 
<151> 2001-12-21 

<160> 30 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 8639 
<212> DNA 
<213> HCV 

<220> 
<221> CDS 

<222> (1803) . . . (8408) 
<400> 1 

ggccagcccc cgattggggg cgacactcca ccatagatca ctcccctgtg aggaactact 60 

gtcttcacgc agaaagcgtc tagccatggc gttagtatga gtgtcgtgca gcctccagga 120 

ccccccctcc cgggagagcc atagtggtct gcggaaccgg tgagtacacc ggaattgcca 180 

ggacgaccgg gtcctttctt ggatcaaccc gctcaatgcc tggagatttg ggcgtgcccc 240 

cgcgagactg ctagccgagt agtgttgggt cgcgaaaggc cttgtggtac tgcctgatag 300 

ggtgcttgcg agtgccccgg gaggtctcgt agaccgtgca ccatgagcac gaatcctaaa 360 

cctcaaagaa aaaccaaagg gcgcgccatg attgaacaag atggattgca cgcaggttct 420 

ccggccgctt gggtggagag gctattcggc tatgactggg cacaacagac aatcggctgc 480 

tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc cggttctttt tgtcaagacc 540 

gacctgtccg gtgccctgaa tgaactgcag gacgaggcag cgcggctatc gtggctggcc 600 

acgacgggcg ttccttgcgc agctgtgctc gacgttgtca ctgaagcggg aagggactgg 660 

ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat ctcaccttgc tcctgccgag 720 

aaagtatcca tcatggctga tgcaatgcgg cggctgcata cgcttgatcc ggctacctgc 780 

ccattcgacc accaagcgaa acatcgcatc gagcgagcac gtactcggat ggaagccggt 840 

cttgtcgatc aggatgatct ggacgaagag catcaggggc tcgcgccagc cgaactgttc 900 

gccaggctca aggcgcgcat gcccgacggc gaggatctcg tcgtgaccca tggcgatgcc 960 

tgcttgccga atatcatggt ggaaaatggc cgcttttctg gattcatcga ctgtggccgg 1020 

ctgggtgtgg cggaccgcta tcaggacata gcgttggcta cccgtgatat tgctgaagag 1080 

cttggcggcg aatgggctga ccgcttcctc gtgctttacg gtatcgccgc tcccgattcg 1140 

cagcgcatcg ccttctatcg ccttcttgac gagttcttct gagttcgcgc ccagatgtta 1200 

acagaccaca acggtttccc tctagcggga tcaattccgc ccccccccct aacgttactg 1260 

gccgaagccg cttggaataa ggccggtgtg cgtttgtcta tatgttattt tccaccatat 1320 

tgccgtcttt tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc 1380 



ctaggggtct ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag 1440 

cagttcctct ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc 1500 

ggaacccccc acctggcgac aggtgcctct gcggccaaaa gccacgtgta taagatacac 1560 

ctgcaaaggc ggcacaaccc cagtgccacg ttgtgagttg gatagttgtg gaaagagtca 1620 

aatggctctc ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt 1680 

gtatgggatc tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa 1740 

aaaacgtcta ggccccccga accacgggga cgtggttttc ctttgaaaaa cacgataata 1800 

cc atg gac egg gag atg gca gca teg tgc gga ggc gcg gtt ttc gta 1847 

Met Asp Arg Glu Met Ala Ala Ser Cys Gly Gly Ala Val Phe Val 
15 10 15 



ggt ctg ata etc ttg acc ttg tea ccg cac tat aag ctg ttc etc get 1895 
Gly Leu lie Leu Leu Thr Leu Ser Pro His Tyr Lys Leu Phe Leu Ala 
20 25 30 



agg etc ata tgg tgg tta caa tat ttt ate acc agg gee gag gca cac 1943 
Arg Leu lie Trp Trp Leu Gin Tyr Phe lie Thr Arg Ala Glu Ala His 
35 40 45 



ttg caa gtg tgg ate ccc ccc etc aac gtt egg ggg ggc cgc gat gec 1991 
Leu Gin Val Trp lie Pro Pro Leu Asn Val Arg Gly Gly Arg Asp Ala 
50 55 60 



gtc ate etc etc acg tgc gcg ate cac cca gag eta ate ttt acc ate 2039 
Val lie Leu Leu Thr Cys Ala lie His Pro Glu Leu lie Phe Thr lie 
65 70 75 



acc aaa ate ttg etc gee ata etc ggt cca etc atg gtg etc cag get 2087 
Thr Lys lie Leu Leu Ala lie Leu Gly Pro Leu Met Val Leu Gin Ala 
80 85 90 95 



ggt ata acc aaa gtg ccg tac ttc gtg cgc gca cac ggg etc att cgt 2135 
Gly lie Thr Lys Val Pro Tyr Phe Val Arg Ala His Gly Leu lie Arg 
100 105 110 



gca tgc atg ctg gtg egg aag gtt get ggg ggt cat tat gtc caa atg 2183 
Ala Cys Met Leu Val Arg Lys Val Ala Gly Gly His Tyr Val Gin Met 
115 120 125 



get etc atg aag ttg gee gca ctg aca ggt acg tac gtt tat gac cat 2231 
Ala Leu Met Lys Leu Ala Ala Leu Thr Gly Thr Tyr Val Tyr Asp His 
130 135 140 



etc acc cca ctg egg gac tgg gee cac gcg ggc eta cga gac ctt gcg 2279 
Leu Thr Pro Leu Arg Asp Trp Ala His Ala Gly Leu Arg Asp Leu Ala 
145 150 155 



gtg gca gtt gag ccc gtc gtc ttc tct gat atg gag acc aag gtt ate 2327 
Val Ala Val Glu Pro Val Val Phe Ser Asp Met Glu Thr Lys Val lie 
160 165 170 175 



acc tgg ggg gca gac acc gcg gcg tgt ggg gac ate ate ttg ggc ctg 2375 
Thr Trp Gly Ala Asp Thr Ala Ala Cys Gly Asp lie lie Leu Gly Leu 
180 185 190 



ccc gtc tec gec cgc agg ggg agg gag ata cat ctg gga ccg gca gac 
Pro Val Ser Ala Arg Arg Gly Arg Glu lie His Leu Gly Pro Ala Asp 



2423 



195 



200 



205 



age ctt gaa ggg cag ggg tgg cga etc etc gcg cct att acg gec tac 2471 

Ser Leu Glu Gly Gin Gly Trp Arg Leu Leu Ala Pro lie Thr Ala Tyr 

210 215 220 

tec caa cag acg cga ggc eta ctt ggc tgc ate ate act age etc aca 2519 

Ser Gin Gin Thr Arg Gly Leu Leu Gly Cys lie lie Thr Ser Leu Thr 

225 230 235 

ggc egg gac agg aac cag gtc gag ggg gag gtc caa gtg gtc tec acc 2567 

Gly Arg Asp Arg Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr 

240 245 250 255 

gca aca caa tct ttc ctg gcg acc tgc gtc aat ggc gtg tgt tgg act 2615 

Ala Thr Gin Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 

260 265 270 

gtc tat cat ggt gec ggc tea aag acc ctt gec ggc cca aag ggc cca 2 663 

Val Tyr His Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro 
275 280 285 

ate acc caa atg tac acc aat gtg gac cag gac etc gtc ggc tgg caa 2711 

lie Thr Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin 

290 295 300 

gcg ccc ccc ggg gcg cgt tec ttg aca cca tgc acc tgc ggc age teg 2759 

Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser 

305 310 315 

gac ctt tac ttg gtc acg agg cat gec gat gtc att ccg gtg cgc egg 2807 

Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg 

320 325 330 335 

egg ggc gac age agg ggg age eta etc tec ccc agg ccc gtc tec tac 2855 

Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr 

340 345 350 

ttg aag ggc tct teg ggc ggt cca ctg etc tgc ccc teg ggg cac get 2903 

Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala 
355 360 365 

gtg ggc ate ttt egg get gec gtg tgc acc cga ggg gtt gcg aag gcg 2 951 

Val Gly lie Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala 

370 375 380 

gtg gac ttt gta ccc gtc gag tct atg gaa acc act atg egg tec ccg 2999 

Val Asp Phe Val Pro Val Glu Ser Met Glu Thr Thr Met Arg Ser Pro 

385 390 395 

gtc ttc acg gac aac teg tec cct ccg gee gta ccg cag aca ttc cag 3047 

Val Phe Thr Asp Asn Ser Ser Pro Pro Ala Val Pro Gin Thr Phe Gin 

400 405 410 415 

gtg gee cat eta cac gee cct act ggt age ggc aag age act aag gtg 3095 

Val Ala His Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val 

420 425 430 



ccg get gcg tat gca gec caa ggg tat aag gtg ctt gtc ctg aac ccg 3143 

Pro Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 

435 440 445 

tec gtc gec gec acc eta ggt ttc ggg gcg tat atg tct aag gca cat 3191 

Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His 

450 455 460 

ggt ate gac cct aac ate aga acc ggg gta agg acc ate acc acg ggt 3239 

Gly lie Asp Pro Asn lie Arg Thr Gly Val Arg Thr lie Thr Thr Gly 

465 470 475 

gec ccc ate acg tac tec acc tat ggc aag ttt ctt gec gac ggt ggt 3287 

Ala Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly 

480 485 490 495 

tgc tct ggg ggc gec tat gac ate ata ata tgt gat gag tgc cac tea 3335 

Cys Ser Gly Gly Ala Tyr Asp lie lie lie Cys Asp Glu Cys His Ser 

500 505 510 

act gac teg acc act ate ctg ggc ate ggc aca gtc ctg gac caa gcg 3383 

Thr Asp Ser Thr Thr lie Leu Gly lie Gly Thr Val Leu Asp Gin Ala 

515 520 525 

gag acg get gga gcg cga etc gtc gtg etc gec acc get acg cct ccg 3431 

Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro 

530 535 540 

gga teg gtc acc gtg cca cat cca aac ate gag gag gtg get ctg tec 347 9 

Gly Ser Val Thr Val Pro His Pro Asn lie Glu Glu Val Ala Leu Ser 

545 550 555 

age act gga gaa ate ccc ttt tat ggc aaa gec ate ccc ate gag acc 3527 

Ser Thr Gly Glu lie Pro Phe Tyr Gly Lys Ala lie Pro lie Glu Thr 

560 565 570 575 

ate aag ggg ggg agg cac etc att ttc tgc cat tec aag aag aaa tgt 3575 

lie Lys Gly Gly Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys 

580 585 590 

gat gag etc gec gcg aag ctg tec ggc etc gga etc aat get gta gca 3623 

Asp Glu Leu Ala Ala Lys Leu Ser Gly Leu Gly Leu Asn Ala Val Ala 

595 600 605 

tat tac egg ggc ctt gat gta tec gtc ata cca act age gga gac gtc 3671 

Tyr Tyr Arg Gly Leu Asp Val Ser Val lie Pro Thr Ser Gly Asp Val 

610 615 620 

att gtc gta gca acg gac get eta atg acg ggc ttt acc ggc gat ttc 3719 

lie Val Val Ala Thr Asp Ala Leu Met Thr Gly Phe Thr Gly Asp Phe 

625 630 635 

gac tea gtg ate gac tgc aat aca tgt gtc acc cag aca gtc gac ttc 3767 

Asp Ser Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 

640 645 650 655 



age ctg gac ccg acc ttc acc att gag acg acg acc gtg cca caa gac 3815 

Ser Leu Asp Pro Thr Phe Thr lie Glu Thr Thr Thr Val Pro Gin Asp 

660 665 670 

gcg gtg tea cgc teg cag egg cga ggc agg act ggt agg ggc agg atg 3863 

Ala Val Ser Arg Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Met 
675 680 685 

ggc att tac agg ttt gtg act cca gga gaa egg ccc teg ggc atg ttc 3911 

Gly lie Tyr Arg Phe Val Thr Pro Gly Glu Arg Pro Ser Gly Met Phe 

690 695 700 

gat tec teg gtt ctg tgc gag tgc tat gac gcg ggc tgt get tgg tac 3959 

Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr 

705 710 715 

gag etc acg ccc gec gag acc tea gtt agg ttg egg get tac eta aac 4007 

Glu Leu Thr Pro Ala Glu Thr Ser Val Arg Leu Arg Ala Tyr Leu Asn 

720 725 730 735 

aca cca ggg ttg ccc gtc tgc cag gac cat ctg gag ttc tgg gag age 4055 

Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ser 

740 745 750 

gtc ttt aca ggc etc acc cac ata gac gec cat ttc ttg tec cag act 4103 

Val Phe Thr Gly Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr 
755 760 765 

aag cag gca gga gac aac ttc ccc tac ctg gta gca tac cag get acg 4151 

Lys Gin Ala Gly Asp Asn Phe Pro Tyr Leu Val Ala Tyr Gin Ala Thr 

770 775 780 

gtg tgc gec agg get cag get cca cct cca teg tgg gac caa atg tgg 4199 

Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp 

785 790 795 

aag tgt etc ata egg eta aag cct acg ctg cac ggg cca acg ccc ctg 4247 

Lys Cys Leu lie Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu 

800 805 810 815 

ctg tat agg ctg gga gec gtt caa aac gag gtt act acc aca cac ccc 4295 

Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu Val Thr Thr Thr His Pro 

820 825 830 

ata acc aaa tac ate atg gca tgc atg teg get gac ctg gag gtc gtc 4343 

lie Thr Lys Tyr lie Met Ala Cys Met Ser Ala Asp Leu Glu Val Val 
835 840 845 

acg age acc tgg gtg ctg gta ggc gga gtc eta gca get ctg gec gcg 4391 

Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 

850 855 860 

tat tgc ctg aca aca ggc age gtg gtc att gtg ggc agg ate ate ttg 4439 

Tyr Cys Leu Thr Thr Gly Ser Val Val lie Val Gly Arg lie lie Leu 

865 870 875 



tec gga aag ccg gec ate att ccc gac agg gaa gtc ctt tac egg gag 



4487 



Ser Gly Lys Pro Ala lie lie Pro Asp Arg Glu Val Leu Tyr Arg Glu 
880 885 890 895 



ttc gat gag atg gaa gag tgc gcc tea cac etc cct tac ate gaa cag 4535 

Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr lie Glu Gin 

900 905 910 

gga atg cag etc gcc gaa caa ttc aaa cag aag gca ate ggg ttg ctg 4583 

Gly Met Gin Leu Ala Glu Gin Phe Lys Gin Lys Ala lie Gly Leu Leu 

915 920 925 

caa aca gcc acc aag caa gcg gag get get get ccc gtg gtg gaa tec 4 631 

Gin Thr Ala Thr Lys Gin Ala Glu Ala Ala Ala Pro Val Val Glu Ser 

930 935 940 

aag tgg egg acc etc gaa gcc ttc tgg gcg aag cat atg tgg aat ttc 4 67 9 

Lys Trp Arg Thr Leu Glu Ala Phe Trp Ala Lys His Met Trp Asn Phe 

945 950 955 

ate age ggg ata caa tat tta gca ggc ttg tec act ctg cct ggc aac 4727 

lie Ser Gly lie Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn 

960 965 970 975 

ccc gcg ata gca tea ctg atg gca ttc aca gcc tct ate acc age ccg 4775 

Pro Ala lie Ala Ser Leu Met Ala Phe Thr Ala Ser lie Thr Ser Pro 

980 985 990 

etc acc acc caa cat acc etc ctg ttt aac ate ctg ggg gga tgg gtg 4823 

Leu Thr Thr Gin His Thr Leu Leu Phe Asn lie Leu Gly Gly Trp Val 

995 1000 1005 

gcc gcc caa ctt get cct ccc age get get tct get ttc gta ggc gcc 4871 

Ala Ala Gin Leu Ala Pro Pro Ser Ala Ala Ser Ala Phe Val Gly Ala 

1010 1015 1020 

ggc ate get gga gcg get gtt ggc age ata ggc ctt ggg aag gtg ctt 4 919 

Gly lie Ala Gly Ala Ala Val Gly Ser lie Gly Leu Gly Lys Val Leu 

1025 1030 1035 

gtg gat att ttg gca ggt tat gga gca ggg gtg gca ggc gcg etc gtg 4 967 

Val Asp lie Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val 

1040 1045 1050 1055 

gcc ttt aag gtc atg age ggc gag atg ccc tec acc gag gac ctg gtt 5015 

Ala Phe Lys Val Met Ser Gly Glu Met Pro Ser Thr Glu Asp Leu Val 

1060 1065 1070 

aac eta etc cct get ate etc tec cct ggc gcc eta gtc gtc ggg gtc 5063 

Asn Leu Leu Pro Ala lie Leu Ser Pro Gly Ala Leu Val Val Gly Val 

1075 1080 1085 

gtg tgc gca gcg ata ctg cgt egg cac gtg ggc cca ggg gag ggg get 5111 

Val Cys Ala Ala lie Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala 

1090 1095 1100 



gtg cag tgg atg aac egg ctg ata gcg ttc get teg egg ggt aac cac 
Val Gin Trp Met Asn Arg Leu lie Ala Phe Ala Ser Arg Gly Asn His 



5159 



1105 



1110 



1115 



gtc tec ccc acg cac tat gtg cct gag age gac get gca gca cgt gtc 5207 

Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val 

1120 1125 1130 1135 

act cag ate etc tct agt ctt ace ate act cag ctg ctg aag agg ctt 5255 

Thr Gin lie Leu Ser Ser Leu Thr lie Thr Gin Leu Leu Lys Arg Leu 

1140 1145 1150 

cac cag tgg ate aac gag gac tgc tec acg cca tgc tec ggc teg tgg 5303 

His Gin Trp lie Asn Glu Asp Cys Ser Thr Pro Cys Ser Gly Ser Trp 

1155 1160 1165 

eta aga gat gtt tgg gat tgg ata tgc acg gtg ttg act gat ttc aag 5351 

Leu Arg Asp Val Trp Asp Trp lie Cys Thr Val Leu Thr Asp Phe Lys 

1170 1175 1180 

acc tgg etc cag tec aag etc ctg ccg cga ttg ccg gga gtc ccc ttc 5399 

Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro Gly Val Pro Phe 

1185 1190 1195 

ttc tea tgt caa cgt ggg tac aag gga gtc tgg egg ggc gac ggc ate 5447 

Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly lie 

1200 1205 1210 1215 

atg caa acc acc tgc cca tgt gga gca cag ate acc gga cat gtg aaa 5495 

Met Gin Thr Thr Cys Pro Cys Gly Ala Gin lie Thr Gly His Val Lys 
1220 



